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"VIDEO CODING AND DECODING METHOD" 



PHFR020144 



FIELD OF THE INVENTION 

The present invention generally relates to the field of video compression 
and, more particularly, to the video coding standards of the MPEG family and to the 
video coding recommendations of the 1TU-H.26X family. More precisely, it relates to a 
video coding method applied to an original video sequence in which the successive 
frames Include one or several arbitrarily shaped video objects (VOs) defined by their 
texture and motion components and an additional shape component, and to a 
corresponding decoding method. 

BACKGROUND OF THE INVENTION 

In the first video standards and recommendations (up to MPEG-2 and 
H.263 respectively), the video, assumed to be rectangular, was described in terms of 
three separate channels : one for luminance and two for chrominance (this three- 
channels based representation scheme has also been used with other compression 
schemes like mesh-based approaches). However, artifacts appear when a scene that 
has to be coded and transmitted and/or stored is composed of several objects with 
Independent movements, especially each time there is a spatio-temporal discontinuity 
(these areas then need to be specifically treated and refined). 

With the MPEG-4 standard, an additional channel has been Introduced : the 
alpha channel (also referred to as the "arbitrary shape channel" in MPEG-4 terminology). 
This alpha channel allows to describe independently the contour (or shape) of each video 
object (VO) present in the concerned scerieand consequently makes it possible to encode 
separately objects while avoiding discontinuities along boundaries of said objects. However, 
a drawback of such a technique Is the waste of bits which is encountered in the cost of the 
overhead required to describe this shape channel. 

SUMMARY OF THE INVENTION 

It is therefore an object of the invention to propose a coding method with which 
said drawback is avoided. 

To this end, the invention relates to a video coding method such as defined in the 
introductory paragraph of the description, said method further comprising the following 
steps : 

(a) a non object-oriented coding step, applied to a small number of frames of the video 

sequence ; 

(b) an object-oriented coding step, applied to all the frames of the sequence that follow 
said small number of frames ; 
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(c) a sequencing step, provided for controlling that said non object-oriented and object- 
oriented coding steps are respectively applied to the appropriate frames, in order to generate a coded 
bitstream including non object-oriented coded data corresponding to said small number of frames 
followed by object-oriented coded data corresponding to said following frames. 

It is also an object of the invention to propose a video decoding method applied to a 
coded bitstream corresponding to an original video sequence In which the successive frames include 
one or several arbitrarily shaped video objects (VOs) defined by their texture and motion components 
and an additional shape component and have been coded by means of a video coding method 
comprising the following steps : 

(a) a non object-oriented coding step, applied to a small number of frames of the video 

sequence ; 

(b) an object-oriented coding step, applied to all the frames of the sequence that follow 
said small number of frames ; 

(c) a sequencing step, provided for controlling that said non object-oriented and object- 
oriented coding steps are respectively applied to the appropriate frames, in order to generate a coded 
bitstream including non object-oriented coded data corresponding to said small number of frames 
followed by object-oriented coded data corresponding to said following frames ; 

said decoding method itself comprising the following steps : 

(1) a first decoding step, applied to said non object-oriented coded data of the coded 
bitstream that correspond to said small number of frames of the original video sequence ; 

(2) a spatio-temporal segmentation step applied to said non object-oriented coded data 
of the coded bitstream that correspond to said small number of frames and provided for 
reconstructing the missing shape component of the VOs ; 

(3) a second decoding step, applied to said object-oriented coded data of the coded 
bitstream that correspond to said following frames ; 

(4) a sequencing step, provided for controlling that said decoding and segmentation steps 
are respectively applied to the appropriate frames. 

DETAILED DESCRIPTION OF THE INVENTION 

Many documents, and for instance the document US 6026195, describe an 
object-oriented video encoding method and device according to the MPEG-4 standard, A 
video input Is analyzed and subdivided into video objects (VOs), each of which is defined 
by three components : shape, motion and texture. The encoding device Includes a shape 
encoder, which encodes a particular representation of the shape of each object, a texture 
encoder, which encodes a representation of the texture of each VO, and a motion encoder, 
which encodes a representation of the motion of each VO. 

Signals representative of the encoded shape, texture and motion of the VOs 
are then sent to a multiplexer and transmitted over a channel or stored in a recording 
medium such as a database, for a future use, in order to be at a later time received by a 
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demultiplexer, that separates the received coded data, and a decoding device. Said 
decoding device in turn indudes a shape decoder, a texture decoder and a motion decoder, 
the outputs of which are sent to a reconstruction device, for instance a compositor (such as 
a personal computer located at a user's home). In said reconstruction device, the VOs are 
reformed into a video image, and the sequence of video images thus reformed can be 
output (for example, displayed or stored in a video library). 

With respect to such a known system, the principle of the invention is to 
modidy both the encoding and decoding parts by performing on the concerned input 
sequence a segmentation both at the encoding and decoding sides. In view of the 
implementation of said principle, a sequencing module is added in the encoding device, in 
order to force the following operations : 

(a) for a small number of frames (or Images) of the sequence, and preferably 
only the two first ones, the shape component of the VOs is not transmitted : the object- 
oriented coding mode is not chosen for these two first images, and there two images are 
coded according to a non object-oriented coding mode, for example according to a block- 
based mode, as if they were one single, rectangular object (this mode Is here called 
"classical"), or a mode based, on a wavelet decomposition ; 

(b) the following frames (i.e. the third one, the fourth one, etc, if only two 
frames have been considered) of the sequence are again coded using the object-oriented 
coding mode, however without transmitting any shape component. 

In the decoding device, a sequencing module Is correspondingly provided in 
order to carry out the following operations ? 

(a) the non object-oriented coded data corresponding to the two first images 
are "classically" decoded by means of a firstdecoding step (i.e., as seen above, according 
for example to the block-based mode or the wavelet-based mode) ; 

(b) a spatio-temporal segmentation step is carried out, based on these two 
3ss£ images; 

(c) the object-oriented coded data corresponding to the so-called following 
images (i.e all the images except the two first ones) are decoded according to the object- 

* oriented decoding mode by means of a second decoding step, the shape information for 
each VO being obtained thanks to the spatio-temporal segmentation process provided in 
tbe decoding device. 

With this technical solution, an object-based processing can be achieved 
without encoding the shape information, and it thus avoids a waste of bits. 

It must be noted that this disclosure is illustrative and that the method 
according to the present invention is not limited to the aforesaid implementation. The 
sH^*>enLdUon process may for Instance be slightly improved by transmitting in the coded 
aartream at the picture level an Information on the number of regions of interest (i.e. of 
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VOs). In this manner, the decoding device can adjust the segmentation step In order to 
obtain exactly the same segmentation that the one at the encoder side* 
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CLAIMS 

1. A video coding method applied to an original video sequence in which the successive 

frames include one or several arbitrarily shaped video objects (VOs) defined by their texture and 
motion components and an additional shape component said method comprising the following steps 

(a) a non object-oriented coding step, applied to a small number of frames of the video 

sequence; 

(b) an object-oriented coding step, applied to all the frames of the sequence that follow 
said small number of frames ; 

(c) a sequencing step, provided for controlling that said non object-oriented and object- 
oriented coding steps are respectively applied to the appropriate frames, in order to generate a coded 
bitstream including non object-oriented coded data corresponding to said small number of frames 
followed by object-oriented coded data corresponding to said following frames! 

2. A coding method according to daim 1, in which said number of frames is equal to two. 

3. A coding method according to daim 1 or 2, wherein said coded bitstream also includes an 
information about the number of regions of interest in the original video sequence. 

4. A coding method according to claim 3, wherein said information about the number of 
regions of interest is given at the picture level. 

5. A video decoding method applied to a coded bitstream corresponding to an original video 
sequence in which the successive frames include one or several arbitrarily shaped video objects (VOs) 
defined by their texture and motion components and an additional shape component and have been 
coded by means of a video coding method comprising the following steps : 

(a) a non object-oriented coding step, applied to a small number of frames of the video 

sequence ; 

(b) an object-oriented coding step, applied to all the frames of the sequence that follow 
said small number of frames ; 

(c) a sequencing step, provided for controlling that said non object-oriented and object- 
oriented coding steps are respectively applied to the appropriate frames, in order to generate a coded 
b.tstream induding non object-oriented coded date corresponding to said small number of frames 
followed by object-oriented coded data corresponding to said following frames ; 

said decoding method itself comprising the following steps : 

(1) a first decoding step, applied to said non object-oriented coded date of the coded 
bitstream that correspond to said small number of frames of the original video sequence ; 

(2) a spatio-temporal segmentation step applied to said non object-oriented coded data 
of the coded bitstream that correspond to said small number of frames and provided for 
reconstructing the missing shape component of the VOs ; 

(3) a second decoding step, applied to said object-oriented coded data of the coded 
bitstream that correspond to said following frames ; 

(4) a sequendng step, provided for controlling that said decoding and segmentation steps 
are respectively applied to the appropriate frames. 
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Abstract 

The Invention relates to a video coding method applied to an original video sequence in 
which the successive frames include one or several arbitrarily shaped video objects (VOs) defined by 
their texture and motion components and an additional shape component According to the invention, 
said method comprises a non object-oriented coding step, applied to a small number of frames of the 
video sequence, an object-oriented coding step, applied to all the frames of the sequence that follow 
said small number of frames, and a sequencing step, provided for controlling that said non object- 
oriented and object-oriented coding steps are respectively applied to the appropriate frames, in order 
to generate a coded bitstream including non object-oriented coded data corresponding to said small 
number of frames followed by object-oriented coded data corresponding to said following frames. The 
invention also relates to a corresponding video decoding method. 
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